confFuse: High-Confidence Fusion Gene Detection across Tumor Entities
نویسندگان
چکیده
Background: Fusion genes play an important role in the tumorigenesis of many cancers. Next-generation sequencing (NGS) technologies have been successfully applied in fusion gene detection for the last several years, and a number of NGS-based tools have been developed for identifying fusion genes during this period. Most fusion gene detection tools based on RNA-seq data report a large number of candidates (mostly false positives), making it hard to prioritize candidates for experimental validation and further analysis. Selection of reliable fusion genes for downstream analysis becomes very important in cancer research. We therefore developed confFuse, a scoring algorithm to reliably select high-confidence fusion genes which are likely to be biologically relevant. Results: confFuse takes multiple parameters into account in order to assign each fusion candidate a confidence score, of which score ≥8 indicates high-confidence fusion gene predictions. These parameters were manually curated based on our experience and on certain structural motifs of fusion genes. Compared with alternative tools, based on 96 published RNA-seq samples from different tumor entities, our method can significantly reduce the number of fusion candidates (301 high-confidence from 8,083 total predicted fusion genes) and keep high detection accuracy (recovery rate 85.7%). Validation of 18 novel, high-confidence fusions detected in three breast tumor samples resulted in a 100% validation rate. Conclusions: confFuse is a novel downstream filtering method that allows selection of highly reliable fusion gene candidates for further downstream analysis and experimental validations. confFuse is available at https://github.com/Zhiqin-HUANG/confFuse.
منابع مشابه
Production and Evaluation of Polyclonal Rabbit Anti-Human p53 Antibody Using Bacterially Expressed Glutathione S-transferase-p53 fusion protein
p53 is a key tumor suppressor gene that is targeted for inactivation during human tumorigenesis. In this study, we produced and characterized polyclonal antihuman p53 antibody. The cDNA encoding the completehuman p53 protein was cloned into pGEX-4T-1 and expressed in Escherichia coli as a fusion protein with Schistosoma japonicum glutathione S-transferase (GST). The rabbits were immunized...
متن کاملGene Expression, Single Nucleotide Variant and Fusion Transcript Discovery in Archival Material from Breast Tumors
Advantages of RNA-Seq over array based platforms are quantitative gene expression and discovery of expressed single nucleotide variants (eSNVs) and fusion transcripts from a single platform, but the sensitivity for each of these characteristics is unknown. We measured gene expression in a set of manually degraded RNAs, nine pairs of matched fresh-frozen, and FFPE RNA isolated from breast tumor ...
متن کاملDetection of abl/bcr Fusion Gene in Patients Affected by Chronic Myeloid Leukaemia by Dual-Colour Interphase Fluorescence in situ Hybridisation
Conventional cytogenetic is the standard technique for detection of Philadelphia (Ph) chromosome in chronic myeloid leukemia (CML). Evaluation of abelson murine leukemia/breakpoint cluster region (abl/bcr) fusion using dual-colour fluorescence in situ hybridization (D-FISH) is an alternative approach allowing rapid and reliable detection of the disease. We employed the technique of interphase D...
متن کاملFrequency of BCR-ABL Fusion Transcripts in Iranian Azeri Turkish patients with Chronic Myeloid Leukemia
Background: The Philadelphia chromosome (Ph) characterized by t (9; 22) (q34; q11.2) is a reciprocal translocation giving rise to a chimeric BCR-ABL fusion gene. Incidence of Ph chromosome is over 98% in Patients with Chronic Myeloid Leukemia (CML) and around 20% in acute lymphoblastic leukemia (ALL). The finding of this fusion gene is essential for diagnosis of CML by detection of various fusi...
متن کاملCharacterization of Iranian Avian Metapneumovirus based on Fusion Gene (F)
Avian metapneumovirus (aMPV) represents one of the most prevalent diseases of poultry mainly in combination with other pathogens, and it is increasing among chickens. In the present study, the detection and characterization of an aMPV subtype B strain circulating in broiler flocks based on fusion (F) gene. In phylogenetic analysis, the isolates are located in B subtype cl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 8 شماره
صفحات -
تاریخ انتشار 2017